NBA: defensive distillation for backdoor removal via neural behavior alignment

نویسندگان

چکیده

Abstract Recently, deep neural networks have been shown to be vulnerable backdoor attacks. A is inserted into via this attack paradigm, thus compromising the integrity of network. As soon as an attacker presents a trigger during testing phase, in model activated, allowing network make specific wrong predictions. It extremely important defend against attacks since they are very stealthy and dangerous. In paper, we propose novel defense mechanism, Neural Behavioral Alignment (NBA), for removal. NBA optimizes distillation process terms knowledge form samples improve performance according characteristics defense. builds high-level representations behavior within order facilitate transfer knowledge. Additionally, crafts pseudo induce student models exhibit behavior. By aligning from with benign teacher network, enables proactive removal backdoors. Extensive experiments show that can effectively six different outperform five state-of-the-art defenses.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending Defensive Distillation

Machine learning is vulnerable to adversarial examples: inputs carefully modified to force misclassification. Designing defenses against such inputs remains largely an open problem. In this work, we revisit defensive distillation—which is one of the mechanisms proposed to mitigate adversarial examples—to address its limitations. We view our results not only as an effective way of addressing som...

متن کامل

On the Effectiveness of Defensive Distillation

We report experimental results indicating that defensive distillation successfully mitigates adversarial samples crafted using the fast gradient sign method [2], in addition to those crafted using the Jacobian-based iterative attack [5] on which the defense mechanism was originally evaluated.

متن کامل

Classifying NBA Offensive Plays Using Neural Networks

The amount of raw information available for basketball analytics has been given a great boost with the availability of player tracking data. This facilitates detailed analyses of player movement patterns. In this paper, we focus on the difficult problem of offensive playcall classification. While outstanding individual players are crucial for the success of a team, the strategies that a team ca...

متن کامل

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

Defensive Distillation is Not Robust to Adversarial Examples

We show that defensive distillation is not secure: it is no more resistant to targeted misclassification attacks than unprotected neural networks.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Cybersecurity

سال: 2023

ISSN: ['2523-3246']

DOI: https://doi.org/10.1186/s42400-023-00154-z